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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: B I LLING - MEDEL , PATRICIA A. 
COHEN, MAURICE 
COLPITIS, TRACEY L. 
FRIEDMAN, PAULA N. 
HA YD EN, MARK 
KLASS, MICHAEL R. 
ROBERTS - RAPP , LISA 
RUSSELL, JOHN C. 
STROUPE, STEPHEN D. 



M (ii) TITLE OF THE INVENTION: REAGENTS AND METHODS FOR THE 

O USEFUL FOR DETECTING DISEASES OF THE GASTROINTESTINAL 

p TRACT 

{If (iii) NUMBER OF SEQUENCES: 51 

M- (iv) CORRESPONDENCE ADDRESS: 

gjl (A) ADDRESSEE: Abbott Laboratories 

Z'\ (B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/828,856 

(B) FILING DATE: 31-MAR-1997 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Becker, Cheryl L. 

(B) REGISTRATION NUMBER: 35,441. 

(C) REFERENCE /DOCKET NUMBER: 6068. US. PI 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 847/935-172 9 

* (B) TELEFAX: 847/938-2623 
(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



ru 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCCAGGAATA ACTAGAGAGG AACAATGGGG TTATTCAGAG GTTTTGTTTT CCTCTTAGTT 60 

CTGTGCCTGC TGCACCAGTC AAATACTTCC TTCATTAAGC TGAATAATAA TGGCTTTGAA 120 

GATATTGTCA TTGTTATAGA TCCTAGTGTG CCAGAAGATG AAAAAATAAT TGAACAAATA 180 

GAGGATATGG TGACTACAGC TTCTACGTAC CTGTTTGAAG CCACAGAAAA AAGATTTTTT 240 

T 241 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CTAGAGAGGA ACAATGGGGT TATTCAGAGG TTTTGTTTTC CTCTTAGTTC TGTGCCTGCT 60 

GCACCAGTCA AATACTTCCT TCATTAAGCT GAATAATAAT GGCTTTGAAG ATATTGTCAT 120 

TGTTATAGAT CCTAGTGTGC CAGAAGATGA AAAAATAATT GAACAAATAG AGGATATGGT 18 0 

GACTACAGCT TCTACGTACC TGTTTGAAGC CACAGAAAA 219 

(2) INFORMATION FOR SEQ ID NO: 3: 



yD. (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 
r ~ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



( ix) FEATURE : 

(A) NAME /KEY : base_jpolymorphism 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= « % N' represents an A or G 
T or C polymorphism at this position" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTNTGTAACG AAAAAACCCA TAATCAAGAA GCTCCAAGCC TACAAAACAT AAAGTGCAAT 
TTTAGAAGTA CATGGGAGGT GATTAGCAAT TCTGAGGATT TTAAAAACAC CATACC CATG 
GTGACACCAC CTCCTCCACC TGTCTTCTCA TTGCTGAAGA TCAGTCAAAG AATTGTGTGC 
TTAGTTCTTG ATAAGTCTGG AAGCATGGGG GGTAAGGACC GCCTAAATCG A 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 4 : 
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TGGGGGGTAA GGACCGCCTA AATCGAATGA ATCAAGCAGC AAAACATTTC CTGCTGCAGA 60 

CTGTTGAAAA TGGATCCTGG GTGGGGATGG TTCACTTTGA TAGTACTGCC ACTATTGTAA 120 

ATAAGCTAAT CCAAATAAAA AGCAGTGATG AAAGAAACAC ACTCATGGCA GGATTACCTA 180 

CATATCCTCT GGGAGGAACT TCCATCTGCT CTGGAATTAA ATATGCATTT CAGGTGA 237 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CTTCCATCTG CTCTGGAATT AAATATGCAT TTCAGGTGAT TGGAGAGCTA CATTCCCAAC 60 

TCGATGGATC CGAAGTACTG CTGCTGACTG ATGGGGAGGA TAACACTGCA AGTTCTTGTA 120 

TTGATGAAGT GAAACAAAGT GGGGCCATTG TTCATTTTAT TGCTTTGGGA AGAGCTGCTG 180 

ATGAAGCAGT AATAGAGATG AGCAAGATAA CAGGAG 216 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: base__polyraorphism 

(B) LOCATION: 24 

<D) OTHER INFORMATION: /note= *'N' represents an A or G or 
T or C polymorphism at this position" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AATTGATAGT ACAGTGGGAA AGGNCACGTT CTTTCTCATC ACATGGAACA GTCTGCCTCC 60 

CAGTATTTCT CTCTGGGATC CCAGTGGAAC AATAATGGAA AATTTCACAG TGGATGCAAC 120 

TTCCAAAATG GCCTATCTCA GTATTCCAGG AACTGCAAAG GTGGGCACTT GGGCATACAA 180 

TCTTCAAGCC AAAGCGAACC C 201 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCAAATTCTT CTGTGCCTCC AATCACAGTG AATGCTAAAA TGAATAAGGA CGTAAACAGT 60 

TTCCCCAGCC CAATGATTGT TTACGCAGAA ATTCTACAAG GATATGTACC TGTTCTTGGA 120 

GCCAATGTGA CTGCTTTCAT TGAATCACAG AATGGACATA CAGAAGTTTT GGAACTTTTG 180 

GATAATGGTG CAGGCGCTGA TTCTTTCAAG AATGATGGAG TCTACTCCAG GTATTTTACA 240 

G * 241 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 242 base pairs 
(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GTGCAGGCGC TGATTCTTTC AAGAATGATG GAGTCTACTC CAGGTATTTT ACAGCATATA 60 

CAGAAAATGG CAGATATAGC TTAAAAGT-TC GGGCTCATGG AGGAGCAAAC ACTGCCAGGC 120 

TAAAATTACG GCCTCCACTG AATAGAGCCG CGTACATACC AGGCTGGGTA GTGAACGGGG 180 

AAATTGAAGC AAACCCGCCA AGACCTGAAA TTGATGAGGA TACTCAGACC ACCTTGGAGG 240 

AT 242 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

O CCCGCCAAGA CCTGAAATTG ATGAGGATAC TCAGACCACC TTGGAGGATT TCAGCCGAAC 60 

[II AGCATCCGGA GGTGCATTTG TGGTATCACA AGTCCCAAGC CTTCCCTTGC CTGACCAATA 120 

n 



CCCACCAAGT CAAATCACAG ACCTTGATGC CACAGTTCAT GAGGATAAGA TTATTCTTAC 180 
ATGGACAGCA CCAGGAGATA ATTTTGATGT TGGAAAAGTT CAACGTTATA TCA 233 



!™1 



(2) INFORMATION FOR SEQ ID NO: 10: ^ 

(i) SEQUENCE CHARACTERISTICS: f- 

(A) LENGTH: 313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

<D) TOPOLOGY: linear J' 

(ix) FEATURE: i 

(A) NAME / KEY : base_jpolymorphism ^ 

<B) LOCATION: 22 |£ 

(D) OTHER INFORMATION: /note= W *N' represents an A or G or ^ 

T or C polymorphism at this position" r. 

(ix) FEATURE: jgj* 

(A) NAME /KEY: base_jpolymorphism tL 

(B) LOCATION: 44 |& 
(D) OTHER INFORMATION: /note= W% N' represents an A or G or 

T or C polymorphism at this position" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCAATACCCA CCAAGTCAAA TNACAGACCT TGATGCCACA GTTNATGAGG ATAAGATTAT 60 

TCTTACATGG ACAGCACCAG GAGATAATTT TGATGTTGGA AAAGTTCAAC GTTATATCAT 120 

AAGAATAAGT GCAAGTATTC TTGATCTAAG AGACAGTTTT GATGATGCTC TTCAAGTAAA 180 

TACTACTGAT CTGTCACCAA AGGAGGCCAA CTCCAAGGAA AGCTTTGCAT TTAAACCAGA 240 

AAATATCTCA GAAGAAAATG CAACCCACAT ATTTATTGCC ATTAAAAGTA TAGATAAAGC 3 00 

ATTTGGCATC AAA 313 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



AAGTATTCTT GATCTAAGAG ACAGTTTTGA TGATGCTCTT CAAGTAAATA CTACTGATCT 60 

GTCACCAAAG GAGGCCAACT CCAAGGAAAG CTTTGCATTT AAACCAGAAA ATATCTCAGA 120 

AGAAAATGCA ACCCACATAT TTATTGCCAT TAAAAGTATA GATAAAAGCA ATTTGACATC 180 

AAAAGT AT C C AACATTGCAC AAGTAACTTT GTTTATCCCT CAAGCAAATC CTGATGACAT 24 0 

TG 242 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY; basejpolymorphism 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= W *N' represents an A or G or 
T or C polymorphism at this position" 

(ix) FEATURE: 

(A) NAME /KEY : base_polymorphism 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= W% N' represents an A or G or 
T or C polymorphism at this position" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

ANANAATGCA ACCCACATAT TTATTGCCAT TAAAAGTATA GATAAAAGCA ATTTGACATC 60 

AAAAGTATCC AACATTGCAC AAGTAACTTT GTTTATCCCT CAAGCAAATC CTGATGACAT 120 

TGATCCTACT CCTACTCCTA CTCCTACTCC TGATAAAAGT CATAATTCTG GAGTTAATAT 180 

TTCTACGCTG GTATTGTCTG TGATTGGG 208 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

CTCCTACTCC TACTCCTGAT AAAAGTCATA ATTCTGGAGT TAATATTTCT ACGCTGGTAT 60 

TGTCTGTGAT TGGGTCTGTT GTAATTGTTA ACTTTATTTT AAGTACCACC ATTTGAACCT 120 

TAACGAAGAA AAAAATCTTC AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT 180 

AAGTAAAGGA TATTTCTGAA T 201 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : basejpolymorphism 

(B) LOCATION: 111 

(D) OTHER INFORMATION: /note= v *N' represents an A or G or 
T or C polymorphism at this position" 
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(ix) FEATURE: 

(A) NAME /KEY : base_polymorphisra 

(B) LOCATION: 244 

(D) OTHER INFORMATION: /note= W1 N' represents an A or G or 
T or C polymorphism at this position" 

(ix) FEATURE: 

(A) NAME/KEY: basejpolymorphism 

(B) LOCATION: 284 

(D) OTHER INFORMATION: /note= n *N' represents an A or G or 
T or C polymorphism at this position" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 



TCTGTTGTAA TTGTTAACTT TATTTTAAGT ACCACCATTT GAACCTTAAC GAAGAAAAAA 60 

ATCTTCAAGT AGACCTAGAA GAGAGTTTTA AAAAACAAAA CAATGTAAGT NAAGGATATT 120 

TCTGAATCTT AAAATTCATC CCATGTGTGA TCATAAACTC ATAAAAATAA TTTTAAGATG 180 

TCGGAAAAGG ATACTTTGAT TAAATAAAAA CACTCATGGA TATGTAAAAA CTGTCAAGAT 240 

TAANATTTAA TAGTTTCATT TATTTGTTAT TTTATTTGTA AGANATAGTG ATGAACAAAG 3 00 

A 3 01 

!«§■ 

S (2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 
11 (A) LENGTH: 229 base pairs 

a £ (B) TYPE: nucleic acid 

rj (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAAAAAAATC TTCAAGTAGA CCTAGAAGAG AGTTTTAAAA AACAAAACAA TGTAAGTAAA 60 

GGATATTTCT GAATCTTAAA ATTCATCCCA TGTGTGATCA TAAACTCATA AAAATAATTT 12 0 

TAAGATGTCG GAAAAGGATA CTTTGATTAA ATAAAAACAC TCATGGATAT GTAAAAACTG 180 

TCAAGATTAA AATTTAATAG TTTCATTTAT TTGTTATTTT ATTTGTAAG 229 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3043 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTAGAGAGGA ACAATGGGGT TATTCAGAGG TTTTGTTTTC CTCTTAGTTC TGTGCCTGCT 60 

GCACCAGTCA AATACTTCCT TCATTAAGCT GAATAATAAT GGCTTTGAAG ATATTGTCAT 120 

TGTTATAGAT CCTAGTGTGC CAGAAGATGA AAAAATAATT GAACAAATAG AGGATATGGT 180 

GACTACAGCT TCTACGTACC TGTTTGAAGC CACAGAAAAA AGATTTTTTT TCAAAAATGT 240 

ATCTATATTA ATTCCTGAGA ATTGGAAGGA AAATCCTCAG TACAAAAGGC CAAAACATGA 300 

AAACCATAAA CATGCTGATG TTATAGTTGC ACCACCTACA CTCCCAGGTA GAGATGAACC 360 

ATACACCAAG CAGTTCACAG AATGTGGAGA GAAAGGCGAA TACATTCACT TCACCCCTGA 420 

CCTTCTACTT GAAAAAAAAC AAAATGAATA TGGACCACCA GGCAAACTGT TTGTCCATGA 4 80 

GTGGGCTCAC CTCCGGTGGG GAGTGTTTGA TGAGTACAAT GAAGATCAGC CTTTCTACCG 540 

TGCTAAGTCA AAAAAAATCG AAGCAACAAG GTGTTCCGCA GGTATCTCTG GTAGAAATAG 60 0 

AGTTTATAAG TGTCAAGGAG GCAGCTGTCT TAGTAGAGCA TGCAGAATTG ATTCTACAAC 660 

AAAACTGTAT GGAAAAGATT GTCAATTCTT TCCTGATAAA GTACAAACAG AAAAAGCATC 72 0 

CATAATGTTT ATGCAAAGTA TTGATTCTGT TGTTGAATTT TGTAACGAAA AAACCCATAA 780 

TCAAGAAGCT CCAAGCCTAC AAAACATAAA GTGCAATTTT AGAAGTACAT GGGAGGTGAT 84 0 

TAGCAATTCT GAGGATTTTA AAAACACCAT ACCCATGGTG ACACCACCTC CTCCACCTGT 90 0 
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jss3 

m 



CTTCTCATTG 

CATGGGGGGT 

GACTGTTGAA 

AAATAAGCTA 

TACATATCCT 

TGGAGAGCTA 

TAACACTGCA 

TGCTTTGGGA 

TCATTTTTAT 

TACATCAGGA 

ACTGAATAGT 

CACGTTCTTT 

TGGAACAATA 

TCCAGGAACT 

AACATTAACT 

GAATGCTAAA 

AATTCTACAA 

GAATGGACAT 

GAATGATGGA 

AAAAGTTCGG 

TAGAGCCGCG 

ACCTGAAATT 

AGGTGCATTT 

TCAAATCACA 

ACCAGGAGAT 

TATTCTTGAT 

ACCAAAGGAG 

AAATGCAACC 

AGTATCCAAC 

TCCTACTCCT 

TACGCTGGTA 

C ATTTGAAC C 

CAAAACAATG 

AACTCATAAA 

ATGGATATGT 

TTGTAAGAAA 



CTGAAGATCA 

AAGGACCGCC 

AATGGATCCT 

ATCCAAATAA 

CTGGGAGGAA 

CATTCCCAAC 

AGTTCTTGTA 

AGAGCTGCTG 

GTTTCAGATG 

AATACTGATC 

AATGCCTGGA 

CTCATCACAT 

ATGGAAAATT 

GCAAAGGTGG 

ATTACAGTAA 

ATGAATAAGG 

GGATATGTAC 

ACAGAAGTTT 

GTCTACTCCA 

GCTCATGGAG 

TACATACCAG 

GATGAGGATA 

GTGGTATCAC 

GACCTTGATG 

AATTTTGATG 

CTAAGAGACA 

GCCAACTCCA 

CACATATTTA 

ATTGCACAAG 

ACTCCTACTC 

TTGTCTGTGA 

TTAACGAAGA 

TAAGTAAAGG 

AATAATTTTA 

AAAAACTGTC 

TAGTGATGAA 



GTCAAAGAAT 

TAAATCGAAT 

GGGTGGGGAT 

AAAGCAGTGA 

CTTCCATCTG 

TCGATGGATC 

TTGATGAAGT 

ATGAAGCAGT 

AAGCTCAGAA 

TCTCCCAGAA 

TGAACGACAC 

GGAACAGTCT 

TCACAGTGGA 

GCACTTGGGC 

CTTCTCGAGC 

ACGTAAACAG 

CTGTTCTTGG 

TGGAACTTTT 

GGTATTTTAC 

GAGCAAACAC 

GCTGGGTAGT 

CTCAGACCAC 

AAGTCCCAAG 

CCACAGTTCA 

TTGGAAAAGT 

GTTTTGATGA 

AGGAAAGCTT 

TTGCCATTAA 

TAACTTTGTT 

CTACTCCTGA 

TTGGGTCTGT 

AAAAAATCTT 

ATATTTCTGA 

AGATGTCGGA 

AAGATTAAAA 

CAAAGATCCT 



TGTGTGCTTA 

GAATCAAGCA 

GGTTCACTTT 

TGAAAGAAAC 

CTCTGGAATT 

CGAAGTACTG 

GAAACAAAGT 

AATAGAGATG 

CAATGGCCTC 

GTCCCTTCAG 

TGTCATAATT 

GCCTCCCAGT 

TGCAACTTCC 

ATACAATCTT 

AGCAAATTCT 

TTTCCCCAGC 

AGCCAATGTG 

GGATAATGGT 

AGCATATACA 

TGCCAGGCTA 

GAACGGGGAA 

CTTGGAGGAT 

CCTTCCCTTG 

TGAGGATAAG 

TCAACGXTAT 

TGCTCTTCAA 

TGCATTTAAA 

AAGTATAGAT 

TATCCCTCAA 

TAAAAGTCAT 

TGTAATTGTT 

CAAGTAGACC 

ATCTTAAAAT 

AAAGGATACT 

TTTAATAGTT 

TTTTCATACT 



GTTCTTGATA 

GCAAAACATT 

GATAGTACTG 

ACACTCATGG 

AAATATGCAT 

CTGCTGACTG 

GGGGCCATTG 

AGCAAGATAA 

ATTGATGCTT 

CTCGAAAGTA 

GATAGTACAG 

ATTTCTCTCT 

AAAATGGCCT 

CAAGCCAAAG 

TCTGTGCCTC 

CCAATGATTG 

ACTGCTTTCA 

GCAGGCGCTG 

GAAAATGGCA 

AAATTACGGC 

ATTGAAGCAA 

TTCAGCCGAA 

CCTGAC CAAT 

ATTATTCTTA 

ATCATAAGAA 

GTAAATACTA 

CCAGAAAATA 

AAAAGCAATT 

GCAAATCCTG 

AATTCTGGAG 

AACTTTATTT 

TAGAAGAGAG 

TCATCCCATG 

TTGATTAAAT 

TCATTTATTT 

GAT 



AGTCTGGAAG 

TCCTGCTGCA 

CCACTATTGT 

C AGGATT AC C 

TTCAGGTGAT 

ATGGGGAGGA 

TTCATTTTAT 

CAGGAGGAAG 

TTGGGGCTCT 

AGGGATTAAC 

TGGGAAAGGA 

GGGATCCCAG 

ATCTCAGTAT 

CGAACC CAGA 

CAATCACAGT 

TTTACGCAGA 

TTGAATCACA 

ATTCTTTCAA 

GATATAGCTT 

CTCCACTGAA 

ACCCGCCAAG 

CAGC ATCCGG 

ACCCACCAAG 

CATGGACAGC 

TAAGTGCAAG 

CTGATCTGTC 

TCTCAGAAGA 

TGACATCAAA 

ATGACATTGA 

TTAATATTTC 

TAAGTACCAC 

TTTTAAAAAA 

TGTGATCATA 

AAAAACACTC 

GTTATTTTAT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3043 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GCAAATTCTT 
TTCCCCAGCC 
GC C AATGTGA 
GATAATGGTG 
GCATATACAG 
GCCAGGCTAA 
AACGGGGAAA 
TTGGAGGATT 
CTTCCCTTGC 
GAGGATAAGA 
CAACGTTATA 
GCTCTTCAAG 
GCATTTAAAC 
AGTATAGATA 
ATCCCTCAAG 
AAAAGTCATA 
GTAATTGTTA 



CTGTGCCTCC 
CAATGATTGT 
CTGCTTTCAT 
CAGGCGCTGA 
AAAATGGCAG 
AATTACGGCC 
TTGAAGCAAA 
TCAGCCGAAC 
CTGACCAATA 
TTATTCTTAC 
TCATAAGAAT 
TAAATACTAC 
CAGAAAATAT 
AAAGCAATTT 
CAAATCCTGA 
ATTCTGGAGT 
ACTTTATTTT 



AATCACAGTG 
TTACGCAGAA 
TGAATCACAG 
TTCTTTCAAG 
ATATAGCTTA 
TCCACTGAAT 
CCCGCCAAGA 
AGCATCCGGA 
CCCACCAAGT 
ATGGACAGCA 
AAGTGCAAGT 
TGATCTGTCA 
CTCAGAAGAA 
GACATCAAAA 
TGACATTGAT 
TAATATTTCT 
AAGTACCACC 



AATGCTAAAA 
ATTCTACAAG 
AATGGACATA 
AATGATGGAG 
AAAGTTCGGG 
AGAGCCGCGT 
CCTGAAATTG 
GGTGCATTTG 
CAAATCACAG 
CCAGGAGATA 
ATTCTTGATC 
CCAAAGGAGG 
AATGCAACCC 
GTATCCAACA 
CCTACTCCTA 
ACGCTGGTAT 
ATTTGAACCT 



TGAATAAGGA 
GATATGTACC 
CAGAAGTTTT 
TCTACTCCAG 
CTCATGGAGG 
ACATACCAGG 
ATGAGGATAC 
TGGTATCACA 
ACCTTGATGC 
ATTTTGATGT 
TAAGAGACAG 
CCAACTCCAA 
ACATATTTAT 
TTGCACAAGT 
CTCCTACTCC 
TGTCTGTGAT 
TAACGAAGAA 



CGTAAACAGT 
TGTTCTTGGA 
GGAACTTTTG 
GTATTTTACA 
AGCAAACACT 
CTGGGTAGTG 
TCAGACCACC 
AGTCCCAAGC 
CACAGTTCAT 
TGGAAAAGTT 
TTTTGATGAT 
GGAAAGCTTT 
TGCCATTAAA 
AACTTTGTTT 
TACTCCTGAT 
TGGGTCTGTT 
AAAAATCTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT AAGTAAAGGA TATTTCTGAA 1080 

TCTTAAAATT CATCCCATGT GTGATCATAA ACTCATAAAA ATAATTTTAA GATGTCGGAA 114 0 

AAGGATACTT TGATTAAATA AAAACACTCA TGGATATGTA AAAACTGTCA AGATTAAAAT 1200 

TTAATAGTTT CATTTATTTG TTATTTTATT TGTAAGAAAT AGTGATGAAC AAAGATCCTT 1260 

TTTCATACTG ATACCTGGTT GTATATTATT TGATGCAACA GTTTTCTGAA ATGATATTTC 1320 

AAATTGCATC AAGAAATTAA AATCATCTAT CTGAGTAGTC AAAATACAAG TAAAGGAGAG 1380 

CAAATAAACA ACATTTGGA 1399 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GCCAGGAATA ACTAGAGAGG AACAATGGGG TTATTCAGAG GTTTTGTTTT CCTCTTAGTT 60 

L4 CTGTGCCTGC TGCACCAGTC AAATACTTCC TTCATTAAGC TGAATAATAA TGGCTTTGAA 120 

U GATATTGTCA TTGTTATAGA TCCTAGTGTG CCAGAAGATG AAAAAATAAT TGAACAAATA 180 

y GAGGATATGG TGACTACAGC TTCTACGTAC CTGTTTGAAG CCACAGAAAA AAGATTTTTT 240 

O TTCAAAAATG TATCTATATT AATTCCTGAG AATTGGAAGG AAAATCCTCA GTACAAAAGG 30 0 

flJ CCAAAACATG AAAACCATAA ACATGCTGAT GTTATAGTTG CACCACCTAC ACTCCCAGGT 360 

Hi AGAGATGAAC CATACACCAA GCAGTTCACA GAATGTGGAG AGAAAGGCGA ATACATTCAC 420 

i. b TTCACCCCTG ACCTTCTACT TGAAAAAAAA CAAAATGAAT ATGGACCACC AGGCAAACTG 480 

II TTTGTCCATG AGTGGGCTCA CCTCCGGTGG GGAGTGTTTG ATGAGTACAA TGAAGATCAG 540 

CCTTTCTACC GTGCTAAGTC AAAAAAAATC GAAGCAACAA GGTGTTCCGC AGGTATCTCT 600 

Nf GGTAGAAATA GAGTTTATAA GTGTCAAGGA GGCAGCTGTC TTAGTAGAGC ATGCAGAATT 660 

s GATTCTACAA CAAAACTGTA TGGAAAAGAT TGTCAATTCT TTCCTGATAA AGTACAAACA 720 

U GAAAAAGCAT CCATAATGTT TATGCAAAGT ATTGATTCTG TTGTTGAATT TTGTAACGAA 780 

L; AAAACCCATA ATCAAGAAGC TCCAAGCCTA CAAAACATAA AGTGCAATTT TAGAAGTACA 840 

TGGGAGGTGA TTAGCAATTC TGAGGATTTT AAAAACACCA TACCCATGGT GACACCACCT 900 

H CCTCCACCTG TCTTCTCATT GCTGAAGATC AGTCAAAGAA TTGTGTGCTT AGTTCTTGAT 960 

yri AAGTCTGGAA GCATGGGGGG TAAGGACCGC CTAAATCGAA TGAATCAAGC AGCAAAACAT 102 0 

f\ TTCCTGCTGC AGACTGTTGA AAATGGATCC TGGGTGGGGA TGGTTCACTT TGATAGTACT 1080 

r: GCCACTATTG TAAATAAGCT AATCCAAATA AAAAGCAGTG ATGAAAGAAA CACACTCATG 114 0 

GCAGGATTAC CTACATATCC TCTGGGAGGA ACTTCCATCT GCTCTGGAAT TAAATATGCA 1200 

TTTCAGGTGA TTGGAGAGCT ACATTCCCAA CTCGATGGAT CCGAAGTACT GCTGCTGACT 1260 

GATGGGGAGG ATAACACTGC AAGTTCTTGT ATTGATGAAG TGAAACAAAG TGGGGCCATT 132 0 

GTTCATTTTA TTGCTTTGGG AAGAGCTGCT GATGAAGCAG TAATAGAGAT GAGCAAGATA 1380 

ACAGGAGGAA GTCATTTTTA TGTTTCAGAT GAAGCTCAGA ACAATGGCCT CATTGATGCT 1440 

TTTGGGGCTC TTACATCAGG AAATACTGAT CTCTCCCAGA AGTCCCTTCA GCTCGAAAGT 1500 

AAGGGATTAA CACTGAATAG TAATGCCTGG ATGAACGACA CTGTCATAAT TGATAGTACA 1560 

GTGGGAAAGG ACACGTTCTT TCTCATCACA TGGAACAGTC TGCCTCCCAG TATTTCTCTC 1620 

TGGGATCCCA GTGGAACAAT AATGGAAAAT TTCACAGTGG ATGCAACTTC CAAAATGGCC 1680 

TATCTCAGTA TTCCAGGAAC TGCAAAGGTG GGCACTTGGG CATACAATCT TCAAGCCAAA 174 0 

GCGAACCCAG AAACATTAAC TATTACAGTA ACTTCTCGAG CAGCAAATTC TTCTGTGCCT 180 0 

C CAATCACAG TGAATGCTAA AATGAATAAG GACGTAAACA GTTTCCCCAG CCCAATGATT 1860 

GTTTACGCAG AAATTCTACA AGGATATGTA CCTGTTCTTG GAGCCAATGT GACTGCTTTC 192 0 

ATTGAATCAC AGAATGGACA TACAGAAGTT TTGGAACTTT TGGATAATGG TGCAGGCGCT 1980 

GATTCTTTCA AGAATGATGG AGTCTACTCC AGGTATTTTA CAGCATATAC AGAAAATGGC 204 0 

AGATATAGCT TAAAAGTTCG GGCTCATGGA GGAGCAAACA CTGCCAGGCT AAAATTACGG 210 0 

CCTCCACTGA ATAGAGCCGC GTACATACCA GGCTGGGTAG TGAACGGGGA AATTGAAGCA 2160 

AACCCGCCAA GACCTGAAAT TGATGAGGAT ACTCAGACCA CCTTGGAGGA TTTCAGCCGA 2220 

ACAGCATCCG GAGGTGCATT TGTGGTATCA CAAGTCCCAA GCCTTCCCTT GCCTGACCAA 2280 

TACCCACCAA GTCAAATCAC AGACCTTGAT GCCACAGTTC ATGAGGATAA GATTATTCTT 234 0 

ACATGGACAG CACCAGGAGA TAATTTTGAT GTTGGAAAAG TTCAACGTTA TATCATAAGA 24 0 0 

ATAAGTGCAA GTATTCTTGA TCTAAGAGAC AGTTTTGATG ATGCTCTTCA AGTAAATACT 246 0 

ACTGATCTGT CACCAAAGGA GGCCAACTCC AAGGAAAGCT TTGCATTTAA ACCAGAAAAT 252 0 

ATCTCAGAAG AAAATGCAAC CCACATATTT ATTGCCATTA AAAGTATAGA TAAAAGCAAT 25 80 

TTGACATCAA AAGTATCCAA CATTGCACAA GTAACTTTGT TTATCCCTCA AGCAAATCCT 264 0 

GATGACATTG ATCCTACTCC TACTCCTACT CCTACTCCTG ATAAAAGTCA TAATTCTGGA 27 0 0 

GTTAATATTT CTACGCTGGT ATTGTCTGTG ATTGGGTCTG TTGTAATTGT TAACTTTATT 276 0 

TTAAGTAC C A CCATTTGAAC CTTAACGAAG AAAAAAATCT TCAAGTAGAC CTAGAAGAGA 282 0 
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GTTTTAAAAA ACAAAACAAT GTAAGTAAAG GATATTTCTG AATCTTAAAA TTCATCCCAT 2 880 

GTGTGATCAT AAACTCATAA AAATAATTTT AAGATGTCGG AAAAGGATAC TTTGATTAAA 2 940 

TAAAAACACT CATGGATATG TAAAAACTGT CAAGATTAAA ATTTAATAGT TTCATTTATT 3000 

TGTTATTTTA TTTGTAAGAA ATAGTGATGA ACAAAGATCC TTTTTCATAC TGATACCTGG 3060 

TTGTATATTA TTTGATGCAA CAGTTTTCTG AAATGATATT TCAAATTGCA TCAAGAAATT 3120 

AAAATCATCT ATCTGAGTAG TCAAAATACA AGTAAAGGAG AGCAAATAAA CAACATTTGG 3180 

A 3181 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGCTCGGAAT TCCGAGCTTG GATCCTCTAG AGCGGCCGCC GACTAGTGAG CTCGTCGACC 60 
CGGGAATT 68 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
J ?1 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AATTAATTCC CGGGTCGACG AGCTCACTAG TCGGCGGCCG CTCTAGAGGA TCCAAGCTCG 60 
GAATTCCG 68 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGCGGATAAC AATTTCACAC AGGA 24 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TGTAAAACGA CGGCC AGT 18 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CTGCCAGGCT AAAATTACGG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATCACAGACC TTGATGCCAC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GCTGGTATTG TCTGTGATTG GGTC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CATCAGGATT TGCTTGAGGG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TATTGGTCAG GCAAGGGAAG 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 
GTGTTTGCTC CTCCATGAGC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CAAGTAGAAG GTCAGGGGTG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
ATAAGTGTCA AGGAGGCAGC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GCAGACTGTT CCATGTGATG 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
ATGTACCTGT TCTTGGAGCC 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

ACGTACCTGT TTGAAGCCAC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GGTAAGGACC GCCTAAATCG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
GAAGTGAAAC AAAGTGGGGC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TTATCCTCCC CATCAGTCAG 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
TCGATTTAGG CGGTCCTTAC 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TGTGGCTTCA AACAGGTACG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GGGTAAGGAC CGCCTAAATC GAATG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
GAGCCCCAAA AGCATCAATG AGG 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
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He 
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He 
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Ala 


Leu 
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He 
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Trp 
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Phe 
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Trp 
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Met 
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Trp 
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680 
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He 
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He 
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Gly 
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He 
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Cys 


Gin 


He 
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Asn 
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He 
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He 
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Ser 
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Glu 
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Gly 


Glu 


Asp 
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430 
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Val 


He 
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Leu 


Lys 
690 


Leu 


Arg 


Pro 
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Leu 
695 


Asn 
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Ala 
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He 
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Thr Ala Ser Gly 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Asn Ser Ser Val Pro Pro He Thr Val Asn Ala Lys Met Asn Lys 

15 10 15 

Asp Val Asn Ser Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Asp Asn Gly Ala Gly Ala Asp Ser Phe Lys Asn Asp Gly Val Tyr Ser 

15 10 15 

Arg Tyr Phe Thr Ala Tyr Thr Glu Asn Gly Arg Tyr Ser Leu Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO : 44 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Val Arg Ala His Gly Gly Ala Asn Thr Ala Arg Leu Lys Leu Arg Pro 

15 10 15 

Pro Leu Asn Arg Ala Ala Tyr lie 
20 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ser Leu Pro Leu Pro Asp Gin Tyr Pro Pro Ser Gin lie Thr Asp Leu 

15 10 15 

Asp Ala Thr Val His Glu Asp Lys lie lie Leu Thr Trp Thr Ala Pro 

20 25 30 

Gly Asp Asn Phe Asp Val Gly Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Tyr Asn Glu Asp Gin Pro Phe Tyr Arg Ala Lys Ser Lys Lys lie Glu 

1 5 10 15 

Ala Thr Arg Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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L eu Ser Arg Ma Cys Arg He Asp Ser Thr Thr Lys Leu Tyr Gly Lys 
Asp Cys Gin Phe Phe Pro Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:48: 
Lys Ser Ser Asp Olu Arg Asn Thr Leu Met Ala Gly Leu Pro Thr Tyr 



i 5 

Pro Leu Gly Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 



1 b 



(B) TYPE: ammo acid 
:i; (C) STRANDEDNESS: single 

Pi ( D ) TOPOLOGY: linear 

■~ (ii) MOLECULE TYPE: None 

t!. (xi) SEQUENCE DESCRIPTION : SEQ ID NO:49: 

Olu lie Asp Glu Asp Thr Gin Thr Thr Leu Glu Asp Phe Ser Arg 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
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Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn Met His Thr Glu His 

x 5 10 ' 15 

His His His His His 

20 



0 



m 



